<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd">
<HTML
><HEAD
><TITLE
>Links</TITLE
><META
NAME="GENERATOR"
CONTENT="Modular DocBook HTML Stylesheet Version 1.79"><LINK
REL="HOME"
TITLE="DataparkSearch Engine 4.54"
HREF="index.en.html"><LINK
REL="UP"
TITLE="DataparkSearch HTML parser"
HREF="dpsearch-htmlparser.en.html"><LINK
REL="PREVIOUS"
TITLE="META tags"
HREF="dpsearch-htmlparser-meta.en.html"><LINK
REL="NEXT"
TITLE="Comments"
HREF="dpsearch-htmlparser-comments.en.html"><LINK
REL="STYLESHEET"
TYPE="text/css"
HREF="datapark.css"><META
NAME="Description"
CONTENT="DataparkSearch - Full Featured Web site Open Source Search Engine Software over the Internet and Intranet Web Sites Based on SQL Database. It is a Free search software covered by GNU license."><META
NAME="Keywords"
CONTENT="shareware, freeware, download, internet, unix, utilities, search engine, text retrieval, knowledge retrieval, text search, information retrieval, database search, mining, intranet, webserver, index, spider, filesearch, meta, free, open source, full-text, udmsearch, website, find, opensource, search, searching, software, udmsearch, engine, indexing, system, web, ftp, http, cgi, php, SQL, MySQL, database, php3, FreeBSD, Linux, Unix, DataparkSearch, MacOS X, Mac OS X, Windows, 2000, NT, 95, 98, GNU, GPL, url, grabbing"><META
NAME="viewport"
CONTENT="width=device-width, initial-scale=1"></HEAD
><BODY
CLASS="SECT1"
BGCOLOR="#FFFFFF"
TEXT="#000000"
LINK="#0000C4"
VLINK="#1200B2"
ALINK="#C40000"
><!--#include virtual="body-before.html"--><DIV
CLASS="NAVHEADER"
><TABLE
SUMMARY="Header navigation table"
WIDTH="100%"
BORDER="0"
CELLPADDING="0"
CELLSPACING="0"
><TR
><TH
COLSPAN="3"
ALIGN="center"
>DataparkSearch Engine 4.54: Reference manual</TH
></TR
><TR
><TD
WIDTH="10%"
ALIGN="left"
VALIGN="bottom"
><A
HREF="dpsearch-htmlparser-meta.en.html"
ACCESSKEY="P"
>Prev</A
></TD
><TD
WIDTH="80%"
ALIGN="center"
VALIGN="bottom"
>Chapter 4. <SPAN
CLASS="APPLICATION"
>DataparkSearch</SPAN
> HTML parser</TD
><TD
WIDTH="10%"
ALIGN="right"
VALIGN="bottom"
><A
HREF="dpsearch-htmlparser-comments.en.html"
ACCESSKEY="N"
>Next</A
></TD
></TR
></TABLE
><HR
ALIGN="LEFT"
WIDTH="100%"></DIV
><DIV
CLASS="SECT1"
><H1
CLASS="SECT1"
><A
NAME="HTMLPARSER-LINKS"
>4.4. Links</A
></H1
><P
>HTML parser understand the following links:</P
><P
></P
><UL
><LI
><P
>&lt;A HREF="xxx"&gt;</P
><P
>&lt;A HREF="xxx" DATA-EXPANDED-URL="yyy" DATA-ULTIMATE-URL="zzz"&gt;</P
><P
>Attributes priority in link selection: data-ultimate-url, data-expanded-url, href.</P
></LI
><LI
><P
>&lt;IMG SRC="xxx"&gt;</P
></LI
><LI
><P
>&lt;LINK HREF="xxx"&gt;</P
></LI
><LI
><P
>&lt;FRAME SRC="xxx"&gt;</P
></LI
><LI
><P
>&lt;AREA HREF="xxx"&gt;</P
></LI
><LI
><P
>&lt;BASE HREF="xxx"&gt;
<DIV
CLASS="NOTE"
><BLOCKQUOTE
CLASS="NOTE"
><P
><B
>Note: </B
>If BASE HREF
value has incorrectly formed URL, current one will be used instead to
compose relative links.</P
></BLOCKQUOTE
></DIV
>
				</P
></LI
></UL
><P
><A
NAME="AEN3331"
></A
>
However, you can specify the list of HTML which would be omitted in new href lookup with <B
CLASS="COMMAND"
>SkipHrefIn</B
> command.
<PRE
CLASS="PROGRAMLISTING"
>SkipHrefIn "img, link, script"</PRE
></P
><P
><A
NAME="AEN3337"
></A
>
By default, <SPAN
CLASS="APPLICATION"
>DataparkSearch</SPAN
> does not follow links with rel=nofollow attribute specified.
But you can alter this behaviour with <B
CLASS="COMMAND"
>"DisableRelNoFollow yes"</B
> command. You need to put this command in your <TT
CLASS="FILENAME"
>indexer.conf</TT
> file.</P
></DIV
><DIV
CLASS="NAVFOOTER"
><HR
ALIGN="LEFT"
WIDTH="100%"><TABLE
SUMMARY="Footer navigation table"
WIDTH="100%"
BORDER="0"
CELLPADDING="0"
CELLSPACING="0"
><TR
><TD
WIDTH="33%"
ALIGN="left"
VALIGN="top"
><A
HREF="dpsearch-htmlparser-meta.en.html"
ACCESSKEY="P"
>Prev</A
></TD
><TD
WIDTH="34%"
ALIGN="center"
VALIGN="top"
><A
HREF="index.en.html"
ACCESSKEY="H"
>Home</A
></TD
><TD
WIDTH="33%"
ALIGN="right"
VALIGN="top"
><A
HREF="dpsearch-htmlparser-comments.en.html"
ACCESSKEY="N"
>Next</A
></TD
></TR
><TR
><TD
WIDTH="33%"
ALIGN="left"
VALIGN="top"
>META tags</TD
><TD
WIDTH="34%"
ALIGN="center"
VALIGN="top"
><A
HREF="dpsearch-htmlparser.en.html"
ACCESSKEY="U"
>Up</A
></TD
><TD
WIDTH="33%"
ALIGN="right"
VALIGN="top"
>Comments</TD
></TR
></TABLE
></DIV
><!--#include virtual="body-after.html"--></BODY
></HTML
>
